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&p! Abstract: Different methods for the calculation of cross sections with many QCD 

particles are compared. To this end, CSW vertex rules, Berends-Giele 
recursion and Feynman-diagram based techniques are implemented as 
, well as various methods for the treatment of colours and phase space 

integration. We find that typically there is only a small window of jet 
multiplicities, where the CSW technique has efficiencies comparable 
or better than both of the other two methods. 
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In the past years, a variety of string-inspired methods has been proposed for the efficient calculation 
of QCD scattering amplitudes with large number of external legs [UEKH]. Compared with tech- 



C\| I niques based on the construction of Feynman diagrams and corresponding helicity amplitudes [3j , 
' these new methods induce a relatively mild growth in computational complexity with increasing 
. multiplicity. A particularly interesting new technique are the CSW vertex rules, which are based 
I on the correspondence between a weakly coupled = 4 Super Yang-Mills theory and a certain 
■ type of string theory. It has been shown in [1] that this method allows to build arbitrary tree-level 
op . colour-ordered amplitudes from MHV amplitudes. In the context of this publication, the CSW 
rules have been implemented into the Sherpa framework [5j. The corresponding code has been 
validated by a comprehensive comparison of cross sections with other programs for the processes 
^ ■ pp — > jets and pp Z-|-jets. For Monte Carlo phase space integration, the standard techniques 
presented in [S] in the implementation of AMEGIC++ [7] are employed. In pure QCD processes the 
HAAG generator [8] is used, which has been modified such that it can be employed with adaptive 
techniques hke VEGAS 

The aim of this publication is to address the issue of efficiency of the CSW technique when deal- 
ing with full cross sections, including summation over colours and helicities, rather than single 
amplitudes. This extends previous studies presented in [inilll]. For purely gluonic processes, it 
has already been pointed out in [11] that Berends-Giele type recursive relations are superior at 
large final state multiplicities. This is mainly due to the fact that the colour decomposition (colour 
ordering) of QCD amplitudes implies a large exponential growth in the number of required partial 
amplitudes, which can, in the case of the Berends-Giele approach, be overcome with the technique 
of colour dressing. It was however also noted there that colour-ordered multi-gluon amplitudes 
for relatively low final state multiplicities (< 5) are most efficiently computed using CSW vertex 
rules. This is because in this regime the number of possible colour configurations is still low. In 
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addition, there is also only a small number of contributing non-MHV amplitudes, such that the 
major part of the calculation involves one MHV vertex only, maximising the impact of the compact 
formulae in the CSW approach. On the other hand, for low multiplicities expressions obtained 
from traditional Feynman-diagram based techniques are also compact and can easily be simpli- 
fied. The natural question arises, whether those methods - Berends-Giele recursion and traditional 
Feynman-diagram based techniques - eventually perform comparably or even better than the CSW 
vertex rules. 

In this publication we aim at quantifying the effects outlined above for cross sections with realistic 
cuts and for some experimentally significant processes. To this end, we compare three different 
numerical programs: 

1. AMEGIC++ [7], which computes cross sections using helicity amplitudes [4J, constructed 
through Feynman diagrams; 

2. the new generator COMIX [12J based on colour-dressed Berends-Giele (CDBG) recursion jllj : 

3. and a new, optimised implementation of CSW rules in the framework of Sherpa. 

Each of the methods prefers a certain type of phase space integrator. Colour-ordered amplitudes are 
employed in the new implementation of the CSW technique and can be constructed in COMIX, such 
that both codes can employ the colour sampling integrator presented in [12]. Since both AMEGIC++ 
and Comix have access to internal propagator lines the techniques of [6] can naturally be employed, 
since they are based on the assumed pole structure of single diagrams. Using AMEGIC++ to extract 
corresponding integration channels from the Feynman diagrams, this method can also be ported 
to be used with the CSW rules. 

We attempt to compare as many generator-integrator pairings as possible for a number of processes 
that will be relevant at the LHC. The outline is as follows. In Sec. [2] the algorithms needed for 
the implementation of the CSW technique are briefiy summarised. The two other matrix element 
generators have been presented in [7Hl2j and we refer to the original publications for further details. 
We do, however, comment on some relevant details of the colour treatment and the phase space 
integration, which have not been published before. In Sec. [3] the programs are validated and their 
respective efficiency in terms of evaluation time per configuration and in terms of integration time 
to reach a certain precision level is discussed. 

2 Implementation 

In this section, we briefiy introduce the basic ingredients for the numerical implementation of the 
CSW rules. Phase space integration algorithms are presented in detail in [7tll2]. therefore only the 
modified HAAG algorithm will be discussed here, cf. App. 12.31 

2.1 Colour factors 

In our implementation we employ the CSW vertex rules for colour-ordered amplitudes; therefore 
the colour structure and the kinematical part of the amplitudes factorise. For QCD amplitudes 
containing quarks the well-known decomposition over fundamental representation matrices Tf^ of 
S'[/(3) jT3] is used. Considering for example one quark line connecting the external quarks 1 and n 
with colours i\ and J„ with the intermediate gluons carrying colours Oj, this decomposition reads 




n) . 



(1) 



(TGSn-2 
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The sum runs over all (n — 2)! permutations of (2, . . . ,n — 1). For purely gluonic amplitudes the 
numerically more efficient representation through matrices (-F")bc = if"'^'^, i-e. the adjoint represen- 
tation, [T5| is employed. Assuming n external gluons carrying colours oi . . . this decomposition 
reads 

A{l,...,n)= J2 {F'"'^F'"'^---F'"'"-^)a,a^A{l,a2,...,an-i,n), (2) 

cr&Sn-2 

where again the sum runs over all (n — 2)! permutations of (2, . . . , n — 1). 



2.2 Amplitude evaluation 

The first non-vanishing helicity configurations for tree amplitudes are the MHV or Parke- Taylor 
amplitudes [15]. They contain n — 2 partons with like-sign helicity and 2 with opposite sign helicity. 
Amplitudes with all or all but one like-sign helicity vanish at tree- level, cf. The MHV (MHV) 
amplitudes can be written as simple holomorphic (anti-holomorphic) functions. In the notation 
of [iHj the n-gluon MHV-amplitudes read 



^n(l^5-")^ , n^) — ^ ,^ „, 1^ „> 7 - TT TT ■ (3) 

(1 2)(2 J) . . . (n-1 n)(n 1) 

MHV amplitudes for one external quark pair are easily obtained using supersymmetric Ward iden- 
tities [ITJ. They are given by 

A(,+ ,2+,...,fc-,...,n-l+,g-) 



(12)... (n-ln)(nl) 
, {kq)\kq) 
^ (12) ... (n-ln)(nl) 



...3/,-. (4) 

A„(g-,2+,...,A:-,...,n-l+,g+) 



Corresponding MHV amplitudes are obtained by complex conjugation, amounting to the replace- 
ment ( ) ^ [ ] • Amplitudes for two external quark pairs and for the scattering into a Drell-Yan 
lepton pair have been discussed in [18]. 



The CSW vertex rules to build full amplitudes from MHV (MHV) amplitudes read [T]: 

• For an amplitude An (l'*^ , . . . , n'*") with 2 < m < n — 2 negative helicity external legs, draw 
all possible diagrams connecting m — 1 MHV (MHV) amplitudes. Their free legs connect to 
the external particles. 

• Construct spinors for internal lines from the corresponding momenta p via = p^jji]^, where 
•q is an arbitrary but fixed reference spinor employed in all CSW diagrams. 

• Associate a scalar propagator 1/p^ with each internal line connecting two MHV vertices. 



Using this algorithm, amplitudes with an arbitrary number of negative (positive) helicity partons 
can be computed. 



2.3 Phase space integration using Haag 

Several approaches have been presented in the past to efficiently sample multi-particle phase spaces 
[SP^|I5] . For multi-jet phase space integration in pure QCD processes, the most advanced algorithm 
so far is the HAAG generator presented in [8]. It is designed to produce momenta approximately 
following a QCD antenna function, cf. Sec. |X] 
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Symmetrisation of antennae 

In analogy to the antenna function in Eq. ([5]), each single HAAG channel can be labeled by a 
specific permutation of the momenta. Since the algorithm always starts from incoming momenta, 
the channels are invariant with respect to cyclic permutations. However, unlike the underlying 
antenna function itself, different HAAG channels are obtained, if the order of the momenta is 
reversed. This is due to the fact that the open antenna algorithm is employed. In order to recover 
symmetry, pairs of channels given by a permutation and its reverse are combined into one, i.e. one 
of the two configurations is chosen with equal probability and the weight is given by the average of 
the two. Furthermore, all antennae can be classified into different types depending on the relative 
position of the incoming momenta (po and pi) within a permutation of the momenta. All channels 
of the same type, i.e. with the same number of final state momenta between pQ and pi, are in 
principle equivalent. They can be obtained from each other by simply relabeling the final state 
momenta. 

Improvements of the algorithm 

To generate an adequate phase space integrator for realistic n-particle QCD processes, different 
HAAG channels can be combined using the multichannel method [20j . cf. [8|. 

The efficiency of the integrator is improved, if additionally the VEGAS algorithm [9j is applied to 
optimise individual single channels VEGAS is very efficient in adapting to functions, whose peaking 
behaviour is not too extreme and which factorise into a product of one-dimensional functions. 
Although this is not necessarily the case for a fully differential cross section, VEGAS can be used to 
better adapt the antenna-like structures in single HAAG-channels to the corresponding substructures 
in the matrix elements, including phase space cuts. 

The equivalence of HAAG-channels of the same type can also be used such that all of them em- 
ploy the same VEGAS map. This alleviates the adaptation significantly, since only very few maps 
survive, with a number that grows only linearly with the number of particles. This can easily 
be understood from the construction of the HAAG channels from a factorially growing number of 
equivalent mappings. 

3 Results 

Various approaches can be used to judge the efficiency of methods to evaluate amplitudes, to 
sum them over colours and helicities and to integrate them over phase space. Finally, however 
it is important that the numerical code which implements the method yields a cross section with 
the desired error as quickly as possible. Theoretically appealing forms of the amplitude are not 
guaranteed to be of maximal help in this respect, see for example 

Therefore, the following strategy to judge the various methods is adopted: 

1. The evaluation times for helicity summed amplitudes are compared. Two different sets are 
generated, which correspond to a colour-summed and a colour-sampled integration, respec- 
tively. In this publication, helicity sampling is not considered, since it introduces additional 
degrees of freedom which in most cases significantly slow down the integration. For low mul- 
tiplicities, this effect is not compensated by the correspondingly lower number of amplitudes 
that have to be evaluated, cf. |12j . 

2. A number of cuts is imposed on the final state to yield physical cross sections for various 
processes. In each case, the integration is terminated once a certain precision of the result has 
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been reached. Note that this effectively tests not only the method for evaluating amplitudes, 
but also the phase space sampling and optimisation. This study corresponds exactly to the 
problem outlined above and can therefore provide vital information about a preference for 
which technique/code to use for the evaluation of cross sections in different regimes of particle 
multiplicity. 

In the following, results obtained with the new implementation of the CSW technique are labeled 
"CS W" , the helicity methods implemented in AMEGIC++ are labeled by "AMEGIC" and the colour- 
dressed Berends-Giele recursion implemented in COMIX is denoted by "COMIX". 

3.1 Amplitude evaluation times 

Colour-summed amplitudes are compared in Tab. [TJ The colour-dressed Berends-Giele recursion 
is not included in this comparison, since it is optimised to generate minimal subsets of colour- 
interfering amplitudes; it would thus be very inefficient for colour-summation. The results compiled 
here clearly show the superiority of the CSW method over the evaluation of Feynman diagrams for 
pure QCD process with no or only one quark-line. In these two cases the expressions obtained from 
CSW rules are very compact, leading to an increasingly better behaviour for larger numbers of final 
state particles. Additionally, the colour-decomposition in the adjoint representation [H] used for the 
CSW amplitudes seems to significantly simplify the computation for purely gluonic amplitudes. For 
two quark lines the two methods are comparable in processes with lower multiplicities. For higher 
multiplicities the standard helicity method, however, quickly leads to unmanageable expressions. 
For processes involving electroweak interactions there is no obvious advantage in using CSW rules. 

To compare evaluation times for colour-ordered amplitudes, matrix elements generated by tradi- 
tional methods in AMEGIC++ are not included, since the projection of the colour structures coming 
from the direct evaluation of full QCD Feynman rules onto colour-ordered amplitudes would not 
lead to a significant simplification compared to the full colour-summed expression. Therefore the 
results for colour-ordered amplitudes shown in Tab. [2] contain only CSW and Berends-Giele recur- 
sion techniques. Obviously, the former is preferred over the latter for multiplicities that require 
the evaluation of MHV contributions only, i.e. processes with at most five particles involved. Once 
next-to-MHV contributions kick in (six and seven particles involved) both methods exhibit a sim- 
ilar performance, beyond that the Berends-Giele technique is clearly the method of choice. This 
corresponds to what has been found in [Tl] for helicity-sampled matrix elements. It should be 
noted, however, that when dressing amplitudes with colour and calculating corresponding matrix 
elements which include subleading colour contributions the colour-dressed Berends-Giele method 
gains additional performance compared to CSW rules, cf. 

3.2 Integration times 

To investigate the integration behaviour of the various codes, the time needed to compute cross 
sections is compared for example processes at the LHC with a centre-of-mass energy of 14 TeV. 
The setup is essentially identical to the one employed in [21]. Respective settings are listed in Tab. 

El 

A variety of phase space generators is used for the integration, which all roughly reproduce the 
peaking behaviour of the matrix elements and which are to some extend adaptive. For colour- 
summed matrix elements we use the improved version of the HAAG generator ("HAAG") or an 
integrator using the standard multi-channel integration technique with channels constructed ac- 
cording to [B], (labeled "MC"). Their usage with colour-sampled matrix elements, however, turns 
out to be quite inefficient since they do not take into account that different colour assignments give 
rise to enhanced contributions in different phase space regions. For colour-sampled multi-gluon 
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scattering a special integrator has been constructed which is denoted by "CSI" (Colour Sampling 
Integrator). It is based on the channels given by the HAAG algorithm, which are selected and 
weighted specific to a corresponding colour assignment. For matrix elements generated by COMIX, 
a recursive phase space generator (denoted "RPG") is used which adapts the recursive structure of 
the Berends-Giele formalism and is thus applicable for very high particle multiplicities. The latter 
two integrators are described in detail in |12j . 

The first class of processes is pure jet production with up to 6 jets in the final state, for different 
maximal numbers of quarks. Tab. U] lists the results for gluon scattering processes. For up to four 
final state particles colour-summed CSW matrix elements combined with HAAG are most efficient. 
Beyond that the number of colour configurations becomes large enough to render the computation 
of colour-summed matrix elements a cumbersome, time-consuming exercise. In this region CDBG 
matrix elements paired with the CSI integrator give the best performance. In Tab. [5] multi-jet 
cross sections with one and two quark lines are studied, the conclusions are essentially the same as 
in the purely gluonic case. 

The second class of processes is given by the production of a lepton pair accompanied by additional 
jets, with results shown in Tab. [H In this case, only the "MC" and "RPG" integrators have been 
employed. Here, the best performance is achieved with colour-summed matrix elements for up to 
three jets. As stated before the CSW rules do not lead to any improvement. 

4 Conclusions 

In this publication a comparison between different matrix element generators, paired with different 
integration techniques has been presented. All of them are at present implemented in the framework 
of the event generator Sherpa. 

Considering the evaluation of matrix elements, it has been shown that for low multiplicities tra- 
ditional techniques to evaluate Feynman diagrams in the helicity formalism perform surprisingly 
well. However, with growing numbers of external legs, these methods quickly lead to unmanageably 
large expressions. The compact formulae for MHV amplitudes result in a significant gain with the 
CSW technique, once QCD amplitudes predominantly containing gluons are concerned. Otherwise, 
the method performs comparably or sometimes even worse than techniques based on Feynman di- 
agrams. With growing numbers of external legs, the colour dressed Berends-Giele recursion is the 
candidate best-suited to quickly evaluate matrix elements. 

Considering the various phase space integrators, it has been shown that traditional methods based 
on the expected pole structure of the integrand as guessed from Feynman diagrams or similar are 
not only versatile enough to yield an appreciable convergence of results for QCD processes. There, 
dedicated algorithms like HAAG and CSI have better integration behaviour, but they are limited in 
their applicability. 

To summarise, a wide range of matrix element evaluation and integration techniques has been made 
available within the Sherpa framework. The different techniques have been tested and compared, 
such that for every given jet multiplicity an optimal performance of the overall package can be 
achieved. 
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A The VEGAS-improved Haag algorithm 

The Haag phase space generator [8j is designed to produce momenta distributed approximately 
according to a QCD antenna function for an n-particle process 

An{pO,Pl,-,Pn-l) = 1 TT ^-^^^ TT T- (5) 

[POPl ){PlP2)-- {Pn-2Pn-l ){Pn-lPo) 

Different antennae can be obtained from permutations of the momenta pi . Cyclic permutation and 
reversion of the order will however lead to the same structure. Generally HAAG relies on phase 
space factorisation over time-like intermediate momenta. In Ref. [8j two algorithms are proposed 
which are referred to as closed and open antenna and which differ in the decomposition of the 2- 
particle phase space d^2- Only the closed antenna contains all factors in Eq. ([5]), while in the open 
antenna one factor {piPi+i) is missing. Although the closed antenna seems to be more symmetric, 
in practice it turns out that the open antenna is more efficient. This is mainly due to its simpler 
structure and less additional weight factors that appear within the algorithmic In the following we 
will therefore focus on open antennas. The algorithm is reviewed for the case of massless external 
particles, however it can easily be generalised to the massive case. 

A.l Antenna Generation 

In the following we use a classification of antenna types by the position of the incoming momenta, 
Po and pi, within the antenna, see Fig. [H The type is then given by Min(m — l,n — m — 1). 

The basic building block for antenna generation is the split of a massive momentum according to 
the phase space element ds d^2{Q]P, P', q), where = s and the last argument, q, defines an axis 
for the momentum generation. We further decompose 

Q ' P 

d^2{Q',P, P', q) = dadcf) , where a = — — (6) 
and (p is an azimuthal angle around q. 

The phase space for a single split, now defined through the variables s, a, cp, is constructed as 
follow^: 



''mill) "max I 



1. Dice s according to 1/s in [smim Smaxj 

2. Dice a according to 1/a in [amim^E 

3. Dice (j) according to a flat distribution in [0, 2tt]. 

4. The momenta are given by 

p = 




P=^-=^,-p , (7) 



Q^{l-2a) - s\ , , /— ^— ^ 

p = \ hcos(p, hsincp, 1 , where n = y' Q''a[l — a) — as. 



^ These weights are nonsingular in any of the products (piPj). 

^ Frame dependent quantities are defined in the CM frame of Q with the 2:-Axis along q 
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Process 






Time per ME 


Time per ME 










AMEGIC++ [s] 


CSW [s] 


AMEGIC++/CSW 


99 2c/ 






6.35 X 10-6 


1.78 X 10-6 


3.6 


yy ^ ^y 






1 68 X 10"^ 


1.73 X 10-^ 


9.7 


nn — > An 
yy ^ ^y 






3 63 X 10"^ 


1.18 X 10-^ 


31 


nn — > F\n 

yy ^ '-'y 








3 29 X 10"^ 




no — y f\n 

yy ^ ^y 








4.56 




nn — > In 
yy ' y 








280 




qq 2g 






3.25 X 10-6 


1.45 X 10-6 


2.2 


qq oy 






S 40 X 1 0"^ 


1.21 X 10-^ 


2 8 


nn — > An 






2 06 X 10"^ 


8 80 X 10"*^ 


2.3 


nn — ^ n 
qq oy 






614 


2 6.'i X 1 0"^ 


23 


nn — ^ (\n 
qq f \jy 








2.96 




QQ — > 7q 








170 




qq qq 






1.36 X 10-6 


2.76 X 10-6 


0.49 


nn — ^ nn n 
HH ^ HH y 






1.11 X IQ-^ 


1.15 X 10-^ 


1.0 


nn — ^ nn On 
qq ^ qq ^y 






4.48 X IQ-^ 


5 30 X 10^'^ 


0.85 


nn — V nn "^^n 
qq ? qq oy 






8 98 X 10-^ 


1.12 X 10^2 


8.0 


nn — y nn An 
qq ^ qq ^y 








0.934 




QQ — > QQ 5(7 
HH HH ^y 








42.0 




qq q'q^ 






7.99 X 10-'^ 


1.52 X 10-6 


0.53 


yy ' y y y 






5.74 X 10-6 


5 65 X 10-6 


1.0 


nn — ^ 

yy ' y y 






1 07 X 10-^ 


2 72 X 1 0-*^ 


39 


nn — > fi^ n^ ^n 

yy ' y y "-"y 






1.34 X 10-^ 


5 80 X 10-'^ 


2.3 


nn — ^ n^ A.n 
qq ' q q 








470 




qq q'q' hg 






- 


20.94 




qq Z{-^ e- 


-e^ 


-) 


1.56 X 10-6 


3.88 X 10-6 


0.40 


qq e' 






3.99 X 10-6 


6.85 X 10-6 


0.58 


qq Z{^ 


-e^ 


) 25 


2.16 X 10-^ 


1.07 X 10"^ 


0.20 


qq — > ^(— > e" 


-e^ 


") 3g 


2.34 X 10-^ 


1.31 X 10-3 


0.18 


qq Z{-^ e" 


-e^ 


")45 


1.44 X 10-2 


8.20 X 10-2 


0.18 



Tab. 1 Computation times for full matrix elements summed over colour and he- 
licity. Displayed are averages for a single evaluation, employing Feynman 
diagrams computed in the helicity formalism (using AMEGIC++) and the 
Cachazo-Svrcek-Witten (CSW) vertex rules. Numbers were generated on 
a 2.53 GHz Intel® CoreTM2 Duo T9400 CPU. 
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Process 



Time per ME 
BG [si 



Time per ME 
CSW [s] 



99 
99 
99 
99 
99 
99 



2ff 

39 

79 



8.42 X 10"^ 
3.19 X 10-5 
1.13 X 10"^ 
3.58 X 10-^ 
1.17 X 10-'^ 
3.99 X 10-3 



1.18 X 10"^ 
3.31 X 10-6 
6.09 X 10-5 
2.91 X 10-^ 
6.38 X 10-3 
5.66 X 10-2 



qq^2g 

qq^ 2>g 
qq 4g 

qq 53 

qq^6g 
Qq^7g 



6.20 X 10-6 
2.18 X 10-5 
6.91 X 10-5 
2.15 X lO-'' 
6.53 X 10-4 
2.03 X 10-3 



1.02 X 10-6 
2.46 X 10-6 
4.59 X 10-5 
2.34 X 10-4 
4.00 X 10-3 
3.11 X 10-2 



qq 



qq 
qq 



qq 

qq 9 
qq 25 
qq 3g 
qq^9 
qq 55 



2.86 X 10-6 

1.17 X 10-5 
4.99 X 10-5 
1.94 X 10-4 
7.16 X 10-4 
2.86 X 10-3 



1.56 X 10-6 
3.26 X 10-6 

5.92 X 10-5 
2.90 X 10-4 

4.93 X 10-3 
3.69 X 10-2 



qq 



qq 
qq 
qq 



q'q! 
q'q^ g 

q'q! 2g 
q'q^ 3g 

g'g' hg 



2.24 X 10-6 
8.97 X 10-6 
2.87 X 10-5 
8.18 X 10-5 
2.70 X 10-4 
8.13 X 10-4 



1.06 X 10-6 
1.96 X 10-6 
3.39 X 10-5 
1.55 X 10-4 
2.48 X 10-3 
1.84 X 10-2 



qq 
qq 

qq 
qq 
qq 



Z{- 

Z{- 
Z{- 
Z{- 



e e ' 



e e ' 



e e 



1 9 

") 35 
-)^9 



3.84 X 10-6 
1.02 X 10-5 
2.57 X 10-5 
7.06 X 10-5 
1.95 X 10-4 



3.88 X 10-6 
6.85 X 10-6 
6.90 X 10-5 
2.95 X 10-4 
3.72 X 10-3 



Tab. 2 Average computation time of partial amplitudes in multi-gluon scatter- 
ing, summed over all helicity configurations. Displayed are averages for 
a single evaluation, employing colour dressed Berends-Giele (BG) re- 
cursion and the Cachazo-Svrcek-Witten (CSW) vertex rules. Numbers 
were generated on a 2.53 GHz Intel® Core'^'^2 Duo T9400 CPU. 
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Parameter 


Value 


EW parameters in the scheme 


Gf 


1.16639 X 10-5 


CtQED 


1/132.51 


sin^ 9w 


0.2222 


Mz 


91.188 GeV 


ruH 


120 GeV 


QCD parameters 


PDF set 


CTEQ6L1 


as 


0.130 




Mz 


jet, initial parton 


g, u, d, s, c 



Parameter 



Value 



Widths (fixed width scheme) 



Tz 



2.446 GeV 



Cuts 



\Vi\ 



> 30 GeV 

< 5 



66 GeV < mil < 116 GeV 
CDF Run II algorithm 
with kr > 30 GeV and D=0.7 



Tab. 3 Parameters for the integration time comparison. 



pp ^ n jets 
gluons only 


n = 2 


n = 3 


n = 4 


n = 5 


n = 6 


MC cross section [pb] 


8.915 • 10^ 


5.454 • 10^ 


1.150 • 10^ 


2.757 • 10^ 


7.95 • 10^ 


stat. error 


0.1% 


0.1% 


0.2% 


0.5% 


1% 




integration time for given stat. error [s] 


CSW (HAAG) 


4 


165 


1681 


12800 


2 • 10*^ 


CSW (CSI) 




480 


6500 


11900 


197000 


AMEGIC (HAAG) 


6 


492 


41400 






COMIX (RPG) 


159 


5050 


33000 


38000 


74000 


COMIX (CSI) 




780 


6930 


6800 


12400 



Tab. 4 Cross section and evaluation times for different matrix element (phase space) generation methods 
for multi-gluon scattering at the LHC, given in pb. Numbers were generated on a 2.53 GHz 
Intel® Core'^^2 Duo T9400 CPU. For cuts and parameter settings, cf. Tab.H 



10 



Tl7~i ^ T7 If^ft; 

<C 1 nnarW linp 


n = 2 


n = 3 


n = 4 


?i = 5 


n = 6 


IVTO cross sprtion \vi\)\ 


1.456 • 10^ 


1.051 • lO'^ 


2.490 • 10^ 


6.75 • 10^ 


2.14 • 10^ 


stat. error 


0.1% 


0.1% 


0.2% 


0.5% 


1% 




integration time for given stat. error [s] 


CSW (HAAG) 


10 


354 


6980 


60000 


9 • 10^ 


AMEGIC (HAAG) 


13 


930 


73000 






COMIX (RPG) 


254 


5370 


15900 


36800 


64100 




— > n jets 
< 2 quark lines 


n = 2 


n = 3 


n = 4 


n = 5 


n = 6 


MC cross section [pb] 


1.5129 • 10^ 


1.1198 • lO'^ 


2.831 • 10^ 


8.12 ■ 10^ 


2.71 • 10^ 


stat. error 


0.1% 


0.1% 


0.2% 


0.5% 


1% 




integration time for given stat. error [s] 


CSW (HAAG) 


16 


730 


12300 


120000 


2 • 10^7 


AMEGIC (HAAG) 


19 


1530 


78000 






COMIX (RPG) 


525 


10800 


25600 


59000 


113000 



Tab. 5 Cross section and evaluation times for different matrix element (phase space) generation meth- 
ods for multi-jet production at the LHC, given in pb. Numbers were generated on a 2.53 GHz 
Intel® Core™2 Duo T9400 CPU. For cuts and parameter settings, cf. Tab.d 



pp — > Z+ jets 


n = 


n = 1 


n = 2 


n = 3 


n = 4 


MC cross section [pb] 


1080.8 


121.67 


54.67 


23.59 


11.22 


stat. error 


0.1% 


0.1% 


0.1% 


0.2% 


0.5% 




integration time for given stat. error [s] 


CSW (MC) 


12 


210 


4100 


57000 


1500000 


AMEGIC (MC) 


7 


98 


1060 


10400 


310000 


COMIX (RPG) 


15 


364 


6400 


16400 


54000 



Tab. 6 Cross section and evaluation times for different matrix element (phase space) generation methods 
for Z+jet production at the LHC, given in pb. Numbers were generated on a 2.53 GHz Intel® 
Core'^^2 Duo T9400 CPU. For cuts and parameter settings, cf. Tab.O 
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Po 



^ ij>Q-P2)ijp2-pz)-(jPm-Pl) (Pl -Pm+l ) - (Pn-2 "Pn- 1 ) ^n-l ) 



Pm+l 

Fig. 1 Antenna configuration. 



5. The weight is given by 

) wnere yi-'^min; -t-maxj — iog . \oj 

Type antennae 

The phase space for type antenna configurations can be obtained by a direct multiple application 
of the basic building block: 

d$n(P0,Pi;P2,-",Pn-l) = dS2 d$2(P0 +Pi;P2,(32;Pl) 

X dS3 d$2(<32;P3,Q3;P2) 

; (9) 

X dSn-3 d$2(Qn-4;?'n-3, (3n-3;Pn-4) 
X d$2(Q n— 3) Pn— 1) Pn— 2) Pn— 3 j • 

The corresponding total weight is given by 

w^zlpji^U+ip) 1 

W ~ ^ ^ 7 r -. , (10) 

n"~3 (E"~^Pi) '.^'l'^'2)(P2-P3) • • • (.Pn-2-Pn-l) 

where the contributions from boundary dependent functions g have been omitted. 
Type 1 antennae 

For this configuration the following phase space decomposition is considered: 

d^n(P0,Pi;P2,-",]3n-l) = dS2 d$2(P0 + Pi; P2, (32; Po) 

X dS3 d$2(<32;P3,Q3;Pl) 
X dS4 d^>2(<33;P4,Q4;P3) 

(11) 

X ds„_3 d<52((3n-4;Pn-3i <3 rt— 3; Pn—A) 
X d$2(Q n—3'iPn—liPn—2'iPn—3) • 
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In the first momentum split, d^2{po+PiTP2, Q2',Po), the variable a is now diced according to ^p^^- 
All following splits are generated according to the basic building block. The corresponding total 
weight is given by 

nn— 3 / 1 \ 

j=3Pj {22i=j+lPi) 1 

w '-^ Po{po+pi - P2) PliPo + Pi -P2) ; -2—7 77 ^ 7 (^^) 

n"~3 (Yy'^Pi) '^^O'^^jbrPa) • • • {Pn-2-Pn-l) 

Type k (> 2) antennae 

In this case we have the following decomposition: 

d$„(po,Pi;P2, •••,Pn-l) = dS2dSk d^2{PQ + Pl]P2,Qk;Po) 

X dS3 d^>2((52;p3,Q3;po) 

X dSk-2 d^2{Qk~3;Pk-2,Qk-2;Pk-3) 

X d^2{Qk-2;Pk-i,Pk;Pk-2) (13) 
X dsk+1 d^2{Qk;Pk+i,Qk+i;pi) 

n— 3) Pn—A) 

X d^2{Q 

All splittings are generated according to the basic building block. The corresponding total weight 
is given by 

ilj=2P] \l^i=j+lP^] ilj=k+lP3 \l^i=j+lP' 

Pl\Pk+l + ■■■+Pn-l) 7 -2^ 7^ -2 

rfe — 2 / V— vfc \ T-rn — 3 / v— vn— 1 



j=2 yl^i=jPij llj=fc+l \l^i=j Pi 

1 



{P0-P2){P2-P3) ■ ■ ■ {Pk^l-Pk){Pl-Pk+l) ■ ■ ■ {Pn-2-Pn-l) ' 

A. 2 Haag and variance reducing techniques 

To generate an adequate phase space integrator for realistic n-particle QCD processes, different 
Haag channels can be combined using the multi-channel method [20]. Symbolically we can write 
a single channel as a map X from uniformly distributed random numbers a € [0, to the 
four-momenta p = (pi, . . . ,pn) of external particles. The corresponding phase space weight g is 
given by 

1 ^ dc^„(X(fl)) 
9 da 

The multi-channel method now combines several maps Xi to a new map as follows: 

fc-l k 

X(a, d) = Xi^{a) , for ^^Oi < d < ^^o; j (15) 

1=1 1=1 

requiring an additional random number a and arbitrary coefficients with > and ^^j, = 1. 
The corresponding phase space weight is given by 

G = ^akgk- (16) 
k 
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The coefficients can be adapted such that the variance of the phase space integral is minimised. 

The efficiency of the integrator is improved if additionally the VEGAS algorithm [9] is applied to the 
single channels. VEGAS is very efficient in the numerical adaptation to functions, whose peaking 
behaviour is not too extreme and which are factorisable to a product of one-dimensional functions. 
Although this is usually not the case for a full differential cross section, it can be used to better 
adapt the antenna-like structures in a single HAAG-channel to the corresponding structures in the 
matrix elements, including phase space cuts. 

For each channel k, VEGAS is used to generate a mapping from uniformly distributed random 
numbers to a non-uniform distribution, still inside the interval [0, 1], and a corresponding weight 
Vk- To combine this with the multi-channel method the mapping X{a) for single channels must be 
invertible, which is the case for HA AG channels. The full map reads 

fc-i k 

X(a,Q) = Xk{^k{a)) , for '^ai<a<^ai. (17) 

1=1 1=1 

For a momentum configuration p the weight is therefore given by 
G{p} =Y.at Okip) vt{X^\p)) . (18) 

k 

We can make use of the equivalence of HAAG-channels of the same type, such that all of them 
employ the same VEGAS map. This alleviates the adaptation significantly, since we are left with 
only very few maps and a linear growth with the number of particles. 
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